On individual neutrality and collective decision making 
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Abstract 

We derive a simple mathematical "theory" to show that two decision-making entities 
can work better together only if at least one of them is occasionally willing to stay 
neutral. This provides a mathematical "justification" for an age-old cliche among 
marriage counselors. 
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1 Introduction 



Suppose that, for each decision one has to make, there are two possible courses of action, a 
good one (£?) and a bad one (B). Let h represent a decision-making agent, where h(Q) > h(B) 
means h favors the good action; h(Q) = h(B) means h is neutral; and h(Q) < h(B) means 
h favors the bad action. In addition, suppose that the payoffs associated with favoring the 
good action, being neutral, and favoring the bad action are +1, 0, and —1, respectively. Let 
V(h) denote the expected payoff for h. Then, 

V(h) = (+1) x P(h{Q) > h{B)) + x P(h{Q) = h{B)) + (-1) x P{h{Q) < h{B)). (1) 
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Clearly, V(h) is a measure of the tendency for h to favor the good action: V(h) > means 
h is more likely to take the good action; V(h) < means it is more likely to take the bad 
one; and V(h) = means it essentially acts randomly. 

Let hi and h 2 be two decision-making agents, e.g., a husband and wife. In such settings, 
it is natural to require (e.g., for concerns of fairness) that neither hi nor h% be allowed to 
dominate the collective decision. This means their respective votes must be on the same 
scale and equally weighted. Without loss of generality, we assume that both hi and hi are 
binary functions, with hi(x) = 1 meaning that hi voted for action x and hi(x) = meaning 
that hi voted against action x. The collective decision is represented by their mean, 

t h + h 2 

h = . 

2 

Definition 1 The collective decision h is said to be synergistic if 

it is said to have positive synergy if the above inequality is strict. 

In what follows, we derive "if and only if" conditions for the collective decision h to be 
synergistic and examine the implications of those conditions. Before we proceed, however, it 
is first necessary for us to explain why our definition of synergy above should depend on the 
specific payoff structure (+1,0, —1) that gave rise to the expression V(h) in equation ([!]). 



1.1 Connection to existing literature: ROC curves 



Our choice o 



curve (IPepe 



V(h ) is deeply related to the so-called receiver-operating characteristic (ROC) 



20031 ). a concept widely used in many scientific fields such as signal detection 



and medical diagnostic tests. 

In the context of medical diagnostic tests, let h(x) be a measurement taken on the patient 
x. Suppose the test declares x to be "sick" if h(x) < c for some threshold c, and "healthy" 
if h(x) > c. The test is not perfect. For a given c, some healthy patients are declared "sick" 
(false positives), while some sick patients are declared "healthy" (false negatives). 

It is easy to see that decreasing the threshold c will always reduce the false positive 
rate but increase the false negative rate of such a test, while increasing the threshold c will 
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always reduce its false negative rate but increase its false positive rate. For example, in 
the extreme case of c = — oo, nobody is declared "sick" so there can be no false positives, 
but the false negative rate reaches 100% since everyone is declared "healthy" including all 
truly sick individuals. The situation is similar but reversed in the other extreme case of 
c = +00. There is an inherent trade-off between the two types of errors — reducing one 
always increases the other, and the optimal choice of c depends on their relative costs. 

The ROC curve essentially traces the two types of errors over the entire range of c, and 
the area under the ROC curve, or simply "area under the curve" (AUC), is a commonly used 
performance metric for evaluating these diagnostic tests. The AUC has the advantage of not 



depending on the relative costs of the t wo types of errors 



interesting probabilistic interpretation (IHanley and McNeil 



n add i tion, it has the following 



1982 



Pepd 120031 ): Suppose % 



denotes the set of all healthy patients, and S denotes the set of all sick patients. Then, 

AUC(/i) = 1 x P{h{Q) > h(B)) + - x P{h(G) = h(B)) + x P(h(G) < h{B)) (3) 

for any randomly chosen Q G H and B G S. If P(h(Q) = h(B)) = 0, meaning h does not 
produce ties between healthy and sick patients, this is simply the "folklore" statement that 
AUC(/i) is the probability that the test h correctly orders healthy patients ahead of sick 
ones. The factor "1/2" means ties are broken at random. 

It is easy to see from (JTJ and (j3J) that 

V{h) = 2AUC(/i) - 1. 

In other words, our expected payoff V(h), as defined in flTJ, is merely a linearly transformed 
version of the widely-used AUC for evaluating the effectiveness of medical diagnostic tests. 
One can think of V(h) as measuring the ability of h to "diagnose" the good action when 
faced with a decision. 



2 Results 

We now state two lemmas for respectively the independent and dependent cases. Together, 
they establish necessary and sufficient conditions for the collective decision h to be syner- 
gistic. Proofs of the two lemmas are given in the appendices. However, the punchline of the 
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paper, which we state in Section [3j is a consequence of these two lemmas, rather than the 
two lemmas themselves. 



2.1 Independent case 

First, suppose hi and h 2 act independently. For % = 1,2, let 

a, = P(h i (G)>h i (B)), 
h = P(hi(Q) = hi(B)), 

a = P(hi(g)<hi(B)). 

In other words, is the probability that hi favors the good action; 6, is the probability that 
hi is neutral; and is the probability that hi favors the bad action. 

Lemma 1 Suppose hi and h 2 are statistically independent. Then, h is synergistic if and 
only if 

bi(a 2 -c 2 ) + b 2 (ai - Cl ) > 0. (4) 

Corollary 1 For h to be synergistic, it is sufficient (but not necessary) for both a\ > c\ and 
a 2 > c 2 . 

Since V(hi) — a\ — c% by ([T]) and likewise for h 2 , the conclusion here is simply this: the 
collective decision is synergistic as long as both decision makers are "no worse than random" , 
having a slightly higher chance of favoring the good action rather than the bad one. 



2.1.1 Connection to existing literature: Weak learners 



'he notion of being "no worse than random" is analogous to that of a "weak learner" 



(lSchapire!ll990t). an important concept i n PAC learning theory (]Vafiantlll984l ) and the theory 



of boosting (IFreund and Schapirdll996l ). 



2.2 Dependent case 

More generally, suppose that hi and h 2 are not independent. For clarity, we make a small 
change in the notation to describe this case, using lowercase letters a, b, c for the marginal 
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probabilities of hi, and uppercase letters A, B, C for those of h 2 : 



a = P{h x {Q) > hi(B)), A = P(h 2 (g) > h 2 {B)); 
b = P(h{g) = h x {B)), B = P(h 2 (g) = h 2 (B)); 

c = p{h!{g) < /11(B)), c = P{h 2 (g) < h 2 {B)). 

Furthermore, since hi and h 2 are dependent, we use P x x to refer to the conditional prob- 
abilities of h 2 given hi, and Qy y to refer to the conditional probabilities of hi given h 2 , as 
displayed in Table [H These notations for P and Q are analo gous to those typically used to 
denote transition probabilities for Markov chains (IRossI 119971 ). 



Of course, P and Q are related by Bayes theorem (jBayea Il763l ) . For example 



p{h 2 {g) = h 2 {B)\hi{g)<hi{B)) = ... 

p(h(g) < hi(B)\h 2 (g) = h 2 (B)) x p(h 2 (g) = h 2 (B)) 



or, using the compact notations of Table dj 

Qbc x B 



cB 



p{hi(g) < hi{B)) 



or cP cB = BQ Bc . 



(5) 



This leads to the following proposition, which is needed in order to prove Lemma [2] that 
follows. 

Proposition 1 For x = a,b,c and Y = A,B, C , xP x y = YQ Yx - 

Lemma 2 Suppose hi and h 2 are statistically dependent, with conditional probabilities given 
by Tabled Then, h is synergistic if and only if 



b (P bA - P b c) + B (Q Ba - Q Bc ) > 0. 



(6) 



Corollary 2 For h to be synergistic, it is sufficient (but not necessary) for both PbA > PbC 
and Q Ba > Qbc- 



Notice the similarity and symmetry of the two "if and only if" conditions, 01]) and They 
can both be interpreted as follows: "Whenever one decision maker is neutral, the other one 
is more likely to favor the good action rather than the bad one." Clearly, this is a highly 
intuitive characterization of the notion of synergy. 
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Table 1: Conditional probabilities. (I) P x x denotes various conditional probabilities of h 2 
given h\. (II) Qy y denotes various conditional probabilities of hi given h 2 . For example, 
P aB = P(h 2 (G) = h 2 {B)\h{g) > h{B)), Q Bc = P{h x {g) < h{B)\h 2 {g) = h 2 {B)), etc. 

(I) 





h 2 {G) > h 2 (B) h 2 {Q) = h 2 {B) h 2 {Q) < h 2 (B) 


h 1 (g)>h 1 (B) 
h 1 (g) = h 1 (B) 
h 1 (g)<h 1 (B) 


PaA PaB PaC 
PbA PbB PbC 
PcA PcB PcC 


(ii) 




h 1 (g)>h 1 (B) h 1 (g) = h 1 (B) h(g)<h(B) 


h 2 {Q) > h 2 (B) 
h 2 (g) = h 2 (B) 
h 2 (G) < h 2 {B) 


QAa QAb Qac 
QBa QBb Qbc 
Qca Qcb Qcc 



3 Conclusion 

It is easy to see that the inequality (J3J) will become an equality when bi = b 2 = 0. The same 
can be said for inequality ([6]) when b = B = 0. These cases correspond to the situation 
where neither hi nor h 2 is ever neutral about a decision. 

Definition 2 A decision-making agent h is said to be opinion-loaded if its probability of 
being neutral is zero, i.e., if P(h(g) = h(B)) = 0. 

Theorem 1 There can be no positive synergy in the collective decision h if both decision- 
making agents are opinion-loaded. 

Our results, therefore, imply that there can be no positive synergy in the collective 
decision without individual neutrality. In other words, the willingness to compromise is not 
an option; it is a necessity! 

Intuitively, this is because opinion-loaded decision-makers are never willing to admit 
that they may sometimes have a hard time making a good decision. As a result, they don't 
give the other decision-maker a chance to take over the decision when it can be beneficial to 
do so; this explains why there can be no positive synergy. In order for there to be positive 
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synergy at all, it is necessary for at least one agent to have a strictly positive probability of 
remaining neutral. 

For centuries, marriage counselors must have been giving such advice to couples all over 
the world, but they may not be aware of the mathematical justification for their age-old 
practice. 



A Proof of Lemma U 

Since both h\ and h 2 are binary functions, h{G) > h(B) if and only if 

(i) htiQ) > h(B) and h 2 {G) > h 2 (B); or 

(ii) h^Q) > h(B) and h 2 (G) = h 2 (B); or 

(iii) hxiQ) = hxiB) and h 2 {Q) > h 2 (B). 

That hi and h 2 are statistically independent means 

P{h{Q) >h{B)) =a 1 a 2 + a 1 b 2 + a 2 b 1 . (7) 

Likewise, 

P(h(Q) <h(B)) = cic 2 + cib 2 + c 2 bi. (8) 
Using the definition ([I]) and (JTJ)- (JHD , we get 
V(h) - 1/ <"'> ± V{h ^ 

( a i — Ci) + (a 2 — c 2 ) 

= [{aia 2 + aib 2 + a 2 bi) - [Cic 2 + c x b 2 + c 2 bi)\ 

= aia 2 -c 1 c 2 +(b 2 -^{ai-c x )+{bi-^\{a 2 -c 2 ). (9) 
Since + 6j + q = 1 for i = 1, 2, dHJ) becomes 

a x a 2 - cic 2 + Q - a 2 - c 2 ^j (a x - ci) + Q - ai - (a 2 - c 2 ) 

, — C 2 . . . . Cti — Cx . 

= (oia 2 - cic 2 ) H (a 2 + c 2 )(ai - ci) H (ai + ci)(a 2 - c 2 ) 

a 2 - c 2 ai - ci , v 
= — g 1 2 ( a i a 2-cic 2 ) 

= ^ [(a 2 - c 2 ) + (ai - ci) - 2(aia 2 - cic 2 )], 

(*) 
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where 

(*) = [(02 - Pa) - (0102 - cica)] + [(ai - ci) - (aia 2 - cic 2 )] 

= [(1 -ai)a 2 - (1 -ci)c 2 ] + [(1 -a 2 )a 1 - (1 - c 2 )d] . (10) 

Finally, using the fact that dj + 6, + Cj = 1 for % = 1, 2 again, we see that ({TO]) is equal to 

[(61 + ci)a 2 - (at + b 1 )c 2 ] + [(62 + c 2 )ai - (a 2 + & 2 )c x ] 
= 61 a 2 - b x c 2 + b 2 a x - b 2 c x 
= 61 (a 2 - c 2 ) + b 2 (a 1 - ci), 

which proves the lemma. □ 



B Proof of Lemma [2 



By the same argument used to start the proof of Lemma [TJ 

v(h) - YM+XM 

= {(aP aA + aP aB + bP bA ) - (cP cC + cP cB + bP bC )] - (a ~ C) + 2 (A ~ C) . (li; 

But 

A = aP aj4 + bP bA + cP cA and C = aP aC + bP bC + cP cC , 
so (|TT|) is equal to 



(aP ayi + aP aB + bP bA ) - (cP cC + cP cB + bP bC ) - ... 

(a - c) (aP aA + 6P M + cP cA ) - {aP aC + bP bC + cP cC ) 



2 2 
Gathering terms multiplying a, 6, and c, respectively, this becomes 

nlP +P 1 + ~ Pa ° \ \ h( P P PbA — Pb ° 
" I faA + r a B " / V ~~ 2 



■ ■■-c(P cC + P c b- 1 " jPc 1 + PcC ). (12) 



Using the fact that 

PaA + PaB + PaC = 1 and P cC + P cB + P cC = 1 , 



expression (|T2|) can be simplified to 



a 



2 



+ b 



( 



PbA — PbC 
2 



) 



— C 



2 



(13) 



However, by Proposition [TJ we have 



aP aB = BQbcl and cP cB = BQ Bc . 



Substituting this into (fTBl leads to the conclusion that ( TTT|) > if and only if 



b (PbA - Pbc) + B (Q Ba - Q Bc ) > 0. □ 
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